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Abstract Mobile phone communication as a digital service generates ever- 
increasing datasets of human communication actions, which in turn allow 
us to investigate the structure and evolution of social interactions and their 
networks. These datasets can be used to study the structuring of such ego- 
centric networks with respect to the strength of the relationships by assuming 
monotonic dependence of the communication intensity on the strength of the 
social tie. Recently we have discovered that there are significant differences 
between the first and further "best friends" from the point of view of age 
and gender preferences. Here we introduce a control parameter, namely the 
minimum ratio r m ; n between the intensities of communication of the first and 
second "best friend" and use it to filter the data. We find that when r min is 
increased the identification of the " best friend" becomes less ambiguous and 
the earlier observed effects get stronger, thus corroborating them. 
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1 Introduction 

Information Communication Technology (ICT) has been (and still is) provid- 
ing a plethora of new services for every individual in society to use, which then 
means that every user transaction is recorded as a kind of "digital footprint" . 
These footprints of ours form ever-growing datasets, such as those of mobile 
phone communication (MPC) of operators, the access to which can open 
up quite unparalleled views on social interactions between individuals on a 
societal scale and, in general, to the structure and dynamics of the society 
PP. So studying these datasets through computational analysis and model- 
ing gives us insight to the system of social interactions [2] , which is different 
from, but at the same time complementary to, that based on questionnaires. 
The main differences are that the latter approach is wide in its social scope 
yet having quite ill-defined and subjective scale of social closeness, while the 
former, e.g. that of MPC, is narrow in its social scope consisting only two 
communication channels, voice calls and text messages (SMS), measured by 
their frequencies or in case of calls also by duration. There are by now some 
good examples of what can be learned and understood about these systems 
and their behavior by using modern data and reality mining methods [3] as 
well as other computational analysis and modeling approaches [4]. 

The closeness of social relationship is one of the fundamental issues in 
sociology and it is crucial in the extended social brain hypothesis [5], which 
states that the capacity of forming social ties is limited to 150 by the brain 
capacity due to human evolution. Within this "Dunbar circle" further order- 
ing takes place. The number of very close relationships is very small (3-5) [5J, 
and in order to be able to maintain them a new close relationship unavoidably 
tends to make an older one weaker [7]. 

In a recent study [5] we have analyzed a seven months period of mobile 
phone records of 3.2 million subscribers from a European provider. We show 
that there are striking differences between the gender preferences of the " best 
friend" (defined by the most frequently contacted partner in the egocentric 
network) and relationships of lesser ranks. The best friend is found to be 
of the opposite sex for ages less than 50 years with a peak around 30. The 
peak for females is higher and appears earlier or at a little younger age than 
for males. Above 50 there is a further difference between males and females: 
While males still prefer females as best friends, females have a slight prefer- 
ence to the same gender. The situation is very different for the second and 
further best friends. Below 50 gender homophily can be observed, but above 
that age males tend to be gender neutral in their selection, while females 
have a tendency to select males. These observations have anthropological in- 
terpretation in terms of gender dependent reproductive investment and they 
demonstrate that our actions are largely motivated by basic evolutionary 
aspects even for the usage of ICT. 

The weak point in the above reasoning is that the ranking of "friends" 
is taken as uniquely determined by the order of MPC intensities or the fre- 
quency of contacts. While communication intensity in general could be a 
good measure of social tie strength [ MTtJlITT] MPC offers only two channels 
of communication among many other possibilities, like face to face, email, so- 
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cial network sites etc. Also there are person to person variations with respect 
to the usage of these channels. By the same token the set of close relation- 
ships for an ego identified on the basis of frequency of calls is not always the 
set of friends an ego spends the most time with in communication. Indeed, 
studying MPC records (throughout this paper we use the same dataset as 
in [8]) we show that different ways to determine the strength of a social tie 
may suggest different persons as the best friends. This raises the question of 
reliability for to identify the closest relationships. In this study we introduce 
a simple parameter and show that it allows us to single out the egos with 
reliably identified best friend and thus to control the corresponding reliability 
level. Our analysis shows that the more reliably identified the best friend the 
more clearly his or her privileged status is observed and the more pronounced 
gender difference in communication found. 

After this introduction we present a section on the identification of the 
close relationships and sections regarding gender preferences and age corre- 
lations. Then we end with the section of conclusions. 



2 Filtering of the closest relationships 

The intensity of communication may have different measures depending on 
the communication channel. Even within one channel there could be several 
characteristics such as for phone calls the number of (outgoing or bidirec- 
tional) calls, (outgoing and bidirectional) call durations. This makes the de- 
termination of the strength of social ties difficult even with the assumption 
that there is a monotonic relationship to the intensity of communication. In 
order to learn how these distinct characteristics may affect our results of close 
social relationships, we first assume that the number of calls by an ego to 
his or her friends can serve as a simple measure of the strength of these rela- 
tionships. Then the best friend of an ego could be defined as the alter whom 
the ego calls most frequently, the second best friend as the subscriber (alter) 
whom the ego calls the second most frequently and so on for the third, fourth 
etc. best friend. This means that the order of the closeness of relationships 
for each ego is defined by the number of outgoing calls. 

Another possible measure of the intensity of communication and, con- 
sequently the closeness of the relationship, is the duration of the calls the 
ego makes. There is an overall positive correlation between call frequency 
and call duration [12] , thus one may expect that duration is an equally good 
measure to determine the "best friend" of an ego. However, this is not always 
the case. Our results show that the best friend, identified on the basis of the 
frequency of calls and on the basis of duration of calls coincides for about 
70% of egos, but in 30% of cases these definitions give different results. 

To overcome this, we suggest that the level of reliability level in identifying 
the best friend for an ego i depends on the ratio 

ri = nii/nn (1) 

between the number of calls n%i by the ego i to his/her best friend and 
to the second best friend ri2%- The reason behind this assumption is that 
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large r, implies preferential use of this communication channel as the overall 
time for communication between people is limited. Then establishing the 
threshold value r m j n and considering the egos with r; > r m j n one may expect 
to achieve a higher reliability level for the best friend identification. Of course, 
this filtering reduces the sample size (as depicted in Fig. [T]), but not too 
much even for the large values of r m j n . In Figs. [2] a and b we show the 
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Fig. 1 Number of subscribers (egos) whose communication patterns satisfy the 
condition n > r m i n . Note that although this filtering procedure reduces statistics, 
there still remains more than 70 000 egos left even for r m i n = 10. 



results of comparisons between five different definitions for to identify the 
best friends of egos. Here we see a clear increase in the percentages of egos 
for whom the different definitions lead to the same individual as the best 
friend. In particular, in Fig. |a we see that by setting the value of the 
control parameter r m - m = 2, the fraction of remaining egos for whom both 
the frequency and duration of calls lead to the same alter as the best friend 
increases up to 85%, thus reducing the fraction of ill-defined cases by a factor 
of 2. Increasing the value of the control parameter r mm beyond 2 reduces 
the fraction of ill-defined cases even further leading to very high degree of 
coincidence between all but one of the definitions best friend alter for an 
ego, namely the definition using only text messages. It is indeed evident in 
Fig. [2^i that there is comparatively low level of overlap or coincidence between 
voice calls and text messages as separate channels of communication. This 
may be caused by these two channels serving different functions in human 
communication. In Fig. [2)d we show the relative reliability of identifying the 
best friend alter of an ego as a function of the control parameter r m ; n for 
five different definitions all including voice calls. These five curves overlap 
indistinguishably, thus leading to same alter as the best friend with very 
high degree of reliability. 

We can conclude here that a simple parameter r m - m allows us to control 
the level of reliability in the identification of the best friend by selecting those 
subscribers for whom several ways to define the closest relationship lead to 
the same person. In the next section [3] we consider whether the patterns 
observed in |5] are maintained with this change of reliability level. We also 
investigate the gender differences in mobile communication. 
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Fig. 2 Coincidence between different definitions to identify the best friend of an 
ego and its reliability. Panel a: probability that the best friend defined on a basis of 
the number of outgoing calls with given value of r m i n coincides with the best friend 
defined by using other five different criteria: number of calls and text messages 
over a link (red line), total duration of calls with the friend (green line), duration 
of outgoing calls (blue line), total number of calls over a link (purple line) and 
number of text messages sent by an ego (among those egos who used text messages 
at all) (light blue line). Apart from using only text messages in the definition all 
other curves coincide quite closely. Panel b: reliability of identifying the best friend 
for given value of r m i n , defined as the probability that the five different ways to 
single out the best friend, i.e. based on the number of out-calls, number of calls 
over a link (in + out), duration of out-calls, total duration of calls, and number of 
calls and text messages over a link, lead to the same alter as the best friend of an 
ego. 



3 Gender differences 

It has been shown [5] that according to gender correlations only one of all the 
friends has a special status: only the closest relationship demonstrates a clear 
tendency to be characterized by the opposite gender to that of an ego. This 
bias turned out to be highest during the reproductively active period. All 
other friends who are of lower ranks or less close show considerably smaller 
gender bias towards the ego's own gender. 

In order to analyze the evolution of this relationship with r m ; n we assign 
a gender variable gi for each subscriber i in such a way that gi = ±1 for male 
and female subscribers, respectively. Defining fi to be a gender of the best 
friend for subscriber i, we get the average gender (/) of the best friend as 
follows: 

(/) = |fy- (2) 

Here i runs over all egos, subject to given restrictions. If the restriction is 
taken conditional to gender of the egos, we get the average gender of the best 
friend for males and females, separately. Then if we include the threshold r m \ n 
as an additional restriction, our original finding [8] about the special status 
of the best friend of an ego gets corroborated. This is clearly seen in Fig. [3j 
where we show the average gender of the best friend, second best friend, and 
the third best friend as a function of r m i n . 
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Fig. 3 Average gender of the best friend (panel a), second best friend (panel b) 
and the third best friend (panel c) as a function of r m i n for males (blue squares) 
and females (red balls). The special status of the best friend is demonstrated by the 
strong bias towards the opposite gender, which is in contrast to less bias in the same 
gender relationships for the second and third best friends. The strengthening of the 
opposite gender preference for best friend as a function of the control parameter 
r m i n verifies our previous findings [8]. 



In order to look more deeply into the gender preferences and to check our 
previous findings about the gender dependent differences in the reproductive 
investment, we consider the average gender of close relationships as a function 
of the age and gender of the ego. 

The average gender of the best friend, second best friend, and the third 
best friend as a function of the age of ego are shown in Fig. [4] for males 
and females separately, showing high level of gender-age correlations. While 
there is no significant dependence in the curves describing the second and 
thirds best friends as a function of r mnl , the special status of the best friend 
becomes again apparent. In fact, we observe that i) females have a stronger 
bias towards males during their reproductive period than the males towards 
females; ii) this changes at around the age of 50 when there is some bias for 
males but not for females. The effects becomes stronger with increasing r m i n , 
supporting our pervious findings obtained without the threshold. 

In more quantitative terms, for r m j n = 1 the highest absolute value for 
the best friend gender for female |(/)| « 0.46 (meaning 72 males as the 
best friends among 100) exceeds the corresponding value for males (|(/)| ~ 
0.41). The change with r m j n of the highest absolute value of the best friend 
gender bias is shown in Fig. [5] a . It shows that the effect of opposite sex 
relationships becomes more evident both for males and females if the best 
friend is defined more reliably. Moreover, Fig. [5}3 giving the ratio between 
peak values for females and males demonstrates that the difference between 
males and females becomes more pronounced with r m ; n . 

As a result we can see that using a threshold r m ; n for controlling the 
reliability of identifying the best friend actually improves our earlier results 
about the gender preference in selecting the best friend as well as the special 
status of the ego having for opposite gender. Similarly the very different 
strategies for reproductive investment for males and females became even 
stronger when we increased the value of the threshold r m - m . 
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Age of ego 

Fig. 4 Average gender of the best friend (left panel column), second best friend 
(central panel column) and the third best friend (right panel column) as a function 
of the age of an ego for different values of the control parameter r m i n . Blue squares 
correspond to male egos and red balls to female egos. 
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Fig. 5 Panel a shows the highest (absolute) value of the average gender of the best 
friend, for males (blue squares) and females (red circles). Panel b shows the ratio 
between these values for females and males as a function of the control parameter 
?"mm- The increasing tendency of this ratio corroborates our previous finding that 
females are more focussed on opposite sex relationships than males during their 
reproductively active period. 
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Apart from the role gender plays in choosing the best friend alter of an 
ego, also age plays an important role for choosing the set of close relation- 
ships [8J. In the next section [4] we will consider the corresponding effect of 
age and will examine how the results obtained for gender preferences and 
their changes with r m j n are reflected as changes in the age structure of close 
friendships. 



4 Age correlations 

In our earlier study [5] we investigated the influence of the ego's age on the age 
distribution of the best friends. A strong effect was observed, such that while 
the distribution turned out to be always bimodal with a distance between the 
peaks corresponding to the generation gap, the peaks corresponding to the 
parents generation was much smaller in the case of the young people than 
the other way around. Moreover, females older that 50 years of age had a 
preference towards females of the young generation, possible their daughters. 
This led us to conclude that there is a shift in the reproductive investment 
strategy of females towards the daughters (and through them, towards the 
grandchildren) after the onset of the menopause. In Fig. [6] the distribution of 
best friend's age for r min = 2 is shown for the cases when the ego is 25 and 50 
years of age. This figure shows that independently of the age and gender of 
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Fig. 6 Age distribution of the best friend for 25 years old male egos (panel a), 25 
years old female egos (panel b), 50 years old male egos (c), and 50 years old female 
egos (d). Blue squares and red balls correspond to male and female best friends, 
respectively. Here the control parameter is set r m - m = 2. 

the ego the strong bias toward the opposite-sex relationships appears around 
ego's own age, but for the age difference of generation gap this bias for 
opposite sex is more balanced. Nonetheless, for 50 years of age females there 
is a significant preference towards females (daughters). The bi-modality of 
this distribution and different gender preferences around each mode leads us 
to suggest different dependence of these relationships on r m i n . 
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In order to investigate the evolution of the age structure of close friends 
we will take the following approach. First, we single out those best friends 
whose age a/ is similar to the age a of an ego af € [a — Aa, a + Aa], where 
Aa defines the width of age interval. Then we select those best friends whose 
age correspond to a parent-child relationship with the same width of the age 
interval a f e [a - 25 - Aa, a - 25 + Aa] U [a + 25 - Aa, a + 25 + Ax]. In Fig. [7] 
we show how the fraction of best friends whose age is either similar to egoV 
own age or differs by about a generation varies with r m j n for Aa = 12.5 and 
Aa = 5. This figure shows clearly that only the fraction of the best friends of 
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Fig. 7 Fraction of male best friends (blue squares) and female best friends (red 
balls) that differ in age from the ego not more than Aa for different values of the 
control parameter r m i n . Purple circles and black squares provide the fraction of 
female and male best friends, respectively, whose age differs from ego's own age by 
25 ± Aa. On the left hand side panels (a, b, c and d) Aa = 12.5 while on the right 
hand side panels (e, f, g and h) Aa = 5. Panels a and e corresponds to 25 years 
old male egos, b and f: 25 years old female egos, c and g: 50 years old male egos, 
d and h: 50 years old female egos. 



similar age and opposite gender to that of the ego increases with r min . This 
tendency is more clearly observed for younger subscribers, which seems to 
agree with our findings of gender preferences in section [3] 



5 Conclusions 

In this study the problem of identifying the closest relationships within a so- 
cial network based on a large MPC dataset has been considered. This prob- 
lem is challenging, both because in reality human communication is multi- 
channeled varying from person to person and because the MPC dataset gives 
us scope of only two channels of communication with measurable features like 
frequency and duration. In this study we have shown that the ranking order 
of close relationships defined by using the frequency of calls does not always 
coincide with the ranking order defined by using the duration of calls as the 
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measure of closeness in the relationship. In order to overcome this problem 
we defined a single additional parameter (r m - m = 711/112), which allowed us to 
effectively single out those subscribers for whom the closest relationships can 
be reliably identified. Increasing the level of reliability in identifying the best 
friend by restricting egos with the control parameter r m ; n we corroborated 
the previously obtained results, namely, the special status of the "best friend" 
and the gender differences in communication with him or her. These findings 
are also reflected as changes in the age structure of close relationships. 
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